Clustering permutations by Exponential Blurring Mean-Shift algorithm
نویسنده
چکیده
Suppose that a sample of people independently examine a fixed set of k items and then rank these items according to personal judgment. Whatever the nature of these items, each person produces a ranking. This paper aims at clustering people into different groups according to their preferences. We propose the exponential blurring mean-shift (EBMS) algorithm which shifts the rankings to new locations obtained by a locally weighted combination of all the data. The number of clusters does not need to be specified in advance and outliers can be detected. Our experiments show that the EBMS algorithm can be succesfully applied in clustering the ranking data. The algorithm generalizes to partial orderings when only the top-t ranks are observed.
منابع مشابه
An Exponential Model for Infinite Rankings
This paper presents a statistical model for expressing preferences through rankings, when the number of alternatives (items to rank) is large. A human ranker will then typically rank only the most preferred items, and may not even examine the whole set of items, or know how many they are. Similarly, a user presented with the ranked output of a search engine, will only consider the highest ranke...
متن کاملOn the Convergence and Consistency of the Blurring Mean-Shift Process
The mean-shift algorithm is a popular algorithm in computer vision and image processing. It can also be cast as a minimum gamma-divergence estimation. In this paper we focus on the “blurring” mean shift algorithm, which is one version of the mean-shift process that successively blurs the dataset. The analysis of the blurring mean-shift is relatively more complicated compared to the nonblurring ...
متن کاملOn mean shift-based clustering for circular data
Cluster analysis is a useful tool for data analysis. Clustering methods are used to partition a data set into clusters such that the data points in the same cluster are the most similar to each other and the data points in the different clusters are the most dissimilar. The mean shift was originally used as a kernel-type weighted mean procedure that had been proposed as a clustering algorithm. ...
متن کاملA review of mean-shift algorithms for clustering
A natural way to characterize the cluster structure of a dataset is by finding regions containing a high density of data. This can be done in a nonparametric way with a kernel density estimate, whose modes and hence clusters can be found using mean-shift algorithms. We describe the theory and practice behind clustering based on kernel density estimates and mean-shift algorithms. We discuss the ...
متن کاملMean shift algorithm for exponential families with applications to speaker clustering
This work extends the mean shift algorithm from the observation space to the manifolds of parametric models that are formed by exponential families. We show how the Kullback-Leibler divergence and its dual define the corresponding affine connection and propose a method for incorporating the uncertainty in estimating the parameters. Experiments are carried out for the problem of speaker clusteri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008